Finding Superior Skyline Points from Incomplete Data
نویسندگان
چکیده
The skyline query has proven to be an important tool in multi-criteria decision making and search space pruning. A skyline query returns the subset of points from a multidimensional dataset that are not dominated by any other point. Due to its wide applications, skyline query and its variants have been extensively studied in the past. However, skyline computation for incomplete domain, where points have missing values for some dimensions, has not received enough attention. The existing solutions for such incomplete datasets use weak pareto dominance relation which is nontransitive and cyclic. Hence, many of the desirable points are not included in the skyline. Consequently, the skyline no longer offers a reliable overview of the dataset. Moreover, the skyline set returned by these methods is unordered and has high cardinality. The end user does not have control over the result size. Therefore, we have adapted the top-k frequent skyline approach proposed for complete datasets to find interesting points from incomplete datasets. The proposed approach overcomes the above mentioned drawbacks and returns top-k points ordered by their fractional skyline frequency. Experimental results on both synthetic and real world datasets demonstrate the ability of our approach to find superior skyline points from incomplete datasets.
منابع مشابه
Finding Skylines for Incomplete Data
In the last decade, skyline queries have been extensively studied for different domains because of their wide applications in multi-criteria decision making and search space pruning. A skyline query returns all the interesting points in a multi-dimensional data set that are not dominated by any other point with respect to all dimensions. However, real world data sets are seldom complete, i.e. d...
متن کاملProcessing Skyline Queries in Incomplete Database: Issues, Challenges and Future Trends
Corresponding Author: Ali A. Alwan Department of Computer Science, Kulliyyah of Information and Communication Technology, International Islamic University Malaysia, Kuala Lumpur 53100, Malaysia Email: [email protected] Abstract: In many contemporary database applications such as multi-criteria decision-making and real-time decision-support applications, data mining, ecommerce and recommendati...
متن کاملAn Algorithm for Retrieving Skyline Points based on User Specified Constraints using the Skyline Ordering
Given a multidimensional data set, a skyline query returns the interesting points that are not dominated by other points. The actual cardinality (s) of a skyline query result may vary substantially from the desired result cardinality (k). An approach called skyline ordering is used that forms a skyline based partitioning of a given data set, it provides an ordering among the partitions. The con...
متن کاملFinding Pareto Optimal Groups: Group-based Skyline
Skyline computation, aiming at identifying a set of skyline points that are not dominated by any other point, is particularly useful for multi-criteria data analysis and decision making. Traditional skyline computation, however, is inadequate to answer queries that need to analyze not only individual points but also groups of points. To address this gap, we generalize the original skyline defin...
متن کاملOn High Dimensional Skylines
In many decision-making applications, the skyline query is frequently used to find a set of dominating data points (called skyline points) in a multidimensional dataset. In a high-dimensional space skyline points no longer offer any interesting insights as there are too many of them. In this paper, we introduce a novel metric, called skyline frequency that compares and ranks the interestingness...
متن کامل